Model Selection

XLSR fine-tuning

# XLSR fine-tuning

Ai Light Dance Stepmania Ft Wav2vec2 Large Xlsr 53 V4

This model is an automatic speech recognition model fine-tuned on the GARY109/AI_LIGHT_DANCE - ONSET-STEPMANIA2 dataset, based on gary109/ai-light-dance_stepmania_ft_wav2vec2-large-xlsr-53-v3.

Speech Recognition

Wav2vec2 Large Ru Golos

A Russian speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, trained on the Sberdevices Golos dataset, supporting 16kHz audio input

Speech Recognition

Transformers Other

This is an Indonesian automatic speech recognition model based on the XLSR Wav2Vec2 architecture, fine-tuned on a public Indonesian speech dataset.

Speech Recognition

Transformers Other

Wav2vec2 Large Xlsr Es Col Pro Noise

A Spanish speech recognition model fine-tuned from jonatasgrosman/wav2vec2-large-xlsr-53-spanish, optimized for Colombian accent and noisy environments

Speech Recognition

Wav2vec2 Large Xlsr Es Col Pro

A Spanish (Colombian accent) speech recognition model fine-tuned based on jonatasgrosman/wav2vec2-large-xlsr-53-spanish

Speech Recognition

Wav2vec2 Large Xlsr Es Col Test

This is a speech recognition model fine-tuned on a specific dataset based on jonatasgrosman/wav2vec2-large-xlsr-53-spanish model, supporting Spanish.

Speech Recognition

Wav2vec2hindiasr

Hindi automatic speech recognition (ASR) model based on Wav2Vec2 architecture, fine-tuned on public speech datasets

Speech Recognition

Wav2vec2 Large Xlsr 53 English

An English speech recognition model fine-tuned from the facebook/wav2vec2-large-xlsr-53 model, trained on the Common Voice 6.1 dataset

Speech Recognition English

Wav2vec2 Large Xlsr 53 Slovenian

This is a Slovenian automatic speech recognition model fine-tuned from Facebook's wav2vec2-large-xlsr-53 model, trained on the Common Voice dataset with a word error rate of 36.04%.

Speech Recognition Other

Wav2vec2 Large Xlsr Kazakh

This is a Kazakh automatic speech recognition (ASR) model fine-tuned from facebook/wav2vec2-large-xlsr-53, trained on the Kazakh speech corpus v1.1 with a test WER of 19.65%.

Speech Recognition Other

Wav2vec2 Large Xlsr Kyrgyz

This is an automatic speech recognition model fine-tuned on the Kyrgyz Common Voice dataset, based on the facebook/wav2vec2-large-xlsr-53 model.

Speech Recognition Other

Wav2vec2 Large Xlsr 53 Ukrainian

A Ukrainian automatic speech recognition (ASR) model fine-tuned from facebook/wav2vec2-large-xlsr-53, trained on the Common Voice dataset.

Speech Recognition Other

Wav2vec2 Large Xlsr 53 Breton

A Breton fine-tuned speech recognition model based on facebook/wav2vec2-large-xlsr-53

Speech Recognition Other

Wav2vec2 10july

This is a German automatic speech recognition model based on the XLSR Wav2Vec2 architecture, fine-tuned on the Common Voice German dataset.

Speech Recognition

Transformers German

Wav2vec2 Large Xlsr 53 Hungarian

This is a Hungarian automatic speech recognition model fine-tuned from the facebook/wav2vec2-large-xlsr-53 model, trained using the Common Voice dataset.

Speech Recognition Other

Wav2vec2 Large Xlsr 53 Eu

A Basque automatic speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, achieving a 15.34% word error rate (WER) on the Common Voice Basque test set.

Speech Recognition

Transformers Other

Wav2vec2 Large Xlsr Turkish Artificial

This is a Turkish speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, trained using artificial Common Voice dataset.

Speech Recognition Other

Wav2vec2 Large Xlsr Cnh

A Hakha Chin speech recognition model fine-tuned from the facebook/wav2vec2-large-xlsr-53 model, trained on the Common Voice dataset with a test WER of 31.38%.

Speech Recognition Other

Wav2vec2 Large Xlsr 53 Polish

XLSR-53 large model speech recognition system optimized for Polish, fine-tuned based on facebook/wav2vec2-large-xlsr-53, supports Polish automatic speech recognition

Speech Recognition Other

Wav2vec2 Large Xlsr Estonian

This is an Estonian automatic speech recognition (ASR) model fine-tuned from the facebook/wav2vec2-large-xlsr-53 model, trained using the Common Voice dataset.

Speech Recognition Other

Wav2vec2 Large Xlsr 53 Irish

A speech recognition model fine-tuned for Irish language using the Common Voice dataset, based on facebook/wav2vec2-large-xlsr-53.

Speech Recognition

Wav2vec2 Large Xlsr Hindi Commonvoice

This model is a fine-tuned version of facebook/wav2vec2-large-xlsr-53 on the common_voice dataset, primarily used for Hindi speech recognition tasks.

Speech Recognition

An automatic speech recognition model fine-tuned on Greek language based on facebook/wav2vec2-large-xlsr-53

Speech Recognition

Transformers Other

Wav2vec2 Large Xlsr 53 Demo Colab

This model is a speech recognition model fine-tuned on the common_voice dataset based on facebook/wav2vec2-large-xlsr-53, primarily used for robust speech event recognition.

Speech Recognition

Wav2vec2 Large Xlsr 53 Latvian

This is an automatic speech recognition (ASR) model fine-tuned on the Latvian Common Voice dataset based on Facebook's Wav2Vec2-Large-XLSR-53 model.

Speech Recognition Other

Wav2vec2 Large Xlsr 53 Rm Vallader

A fine-tuned speech recognition model for the Romansh Vallader dialect based on facebook/wav2vec2-large-xlsr-53, achieving a word error rate of 32.89%

Speech Recognition

Indonesian automatic speech recognition (ASR) model fine-tuned on the XLSR architecture, trained on the Common Voice Indonesian dataset

Speech Recognition

Transformers Other

Wav2vec2 Large Xlsr Nahuatl

A Nahuatl (ncj dialect) speech recognition model fine-tuned based on facebook/wav2vec2-large-xlsr-53

Speech Recognition

Wav2vec2 Large Xlsr Georgian

Georgian automatic speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, supporting 16kHz sampled audio input

Speech Recognition

Transformers Other

Wav2vec2 Large Xlsr Mongolian

An automatic speech recognition model fine-tuned on the Mongolian Common Voice dataset based on facebook/wav2vec2-large-xlsr-53

Speech Recognition Other

Wav2vec2 Large Xlsr Javanese

A Javanese automatic speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, trained on high-quality Javanese TTS data from OpenSLR.

Speech Recognition Other

Wav2vec2 Large Xlsr Sundanese

A Sundanese speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, trained on high-quality TTS data from OpenSLR

Speech Recognition Other

Wav2vec2 Large Xlsr Hungarian

This is an automatic speech recognition (ASR) model fine-tuned on the Hungarian Common Voice dataset, based on the facebook/wav2vec2-large-xlsr-53 model.

Speech Recognition Other

Wav2vec2 Large Xlsr 53 Tatar

An automatic speech recognition model fine-tuned on Tatar language based on facebook/wav2vec2-large-xlsr-53, supporting 16kHz sampled audio input.

Speech Recognition Other

Wav2vec2 Large Xlsr Coraa Portuguese Cv8

A Portuguese speech recognition model fine-tuned on the Common Voice dataset based on Edresson/wav2vec2-large-xlsr-coraa-portuguese

Speech Recognition

Wav2vec2 Large Xlsr 53 Kyrgyz

This is a Kyrgyz automatic speech recognition model fine-tuned from the facebook/wav2vec2-large-xlsr-53 model, trained using public speech datasets.

Speech Recognition Other

Wav2vec2 Large Xlsr Punjabi

This is an automatic speech recognition (ASR) model fine-tuned on Punjabi speech data based on the facebook/wav2vec2-large-xlsr-53 model.

Speech Recognition

Wav2vec2 Large Xlsr Arabic

A speech recognition model fine-tuned on the Arabic Common Voice dataset based on facebook/wav2vec2-large-xlsr-53

Speech Recognition

Transformers Arabic

Wav2vec2 Large Xlsr 53 Turkish

A Turkish speech recognition model fine-tuned on the Common Voice dataset based on Facebook's wav2vec2-large-xlsr-53 model

Speech Recognition Other

Wav2vec2 Large Xlsr 53 Ia

An Interlingua speech recognition model fine-tuned from Facebook's wav2vec2-large-xlsr-53 model, achieving a 22.08% word error rate on the Common Voice Interlingua dataset.

Speech Recognition Other

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase